Reinforcement Learning for Argumentation: Describing a PhD Research
نویسندگان
چکیده
Artificial intelligence (AI) is increasingly studied in many fields such as philosophy, law and decision making. One of the approaches to AI is the use of agent and multi-agent systems. Agents are key element for building complex largescale distributed systems[9]. In multi-agent systems, each agent interacts with the environment and communicates with other agents in order to achieve the designated goal. Communication means to share and exchange information, cooperate and coordinate with each other in order to achieve a common goal. Argumentation is a type of communication between agents and a process attempting to form an agreement about what to believe. There has been increasing research in argumentation and dialogue systems in the past decade[23]. The agent as a dialogue participant needs sophisticated dialogue strategies in order to make high quality dialogue contributions. By reviewing the state of art literature in computerised dialogue systems (e.g.[21];[22]), it is observed that their dialogue strategies (i.e. strategic heuristics) are hardwired into the computational agent. One of the main issues with this is that an agent might be incapable of dealing with new dialogue situations that have not been coded, and indeed this is an impossible task given the dynamic nature of argumentation. It would be ideal to make an agent search for an optimal strategy by itself e.g. via trial and error, and thus the agent with the best strategy will win the argument [8]. Machine learning has an important role to play in order to meet these challenges. To make agents learn the dialogue strategies, it would be more flexible for them to make an argument through exploration (trial and error). It is believed that learning can make agents more flexible to adapt to new environments and new dialogue situations. One of the popular machine learning approaches with regards to learning agents is known as reinforcement learning (RL). Reinforcement learning focuses on how to map an action for each state by interacting with the environment and observing the state change[15]. Sutton and Barto[15] define reinforcement learning as an agent learning what to do and how to connect each situation with an action to maximise the cumulative reward. The learner or agent is not told what action should be taken, rather the learner needs to explore a policy that yields the maximum cumulative reward by trying them out. In reinforcement learning, the agent interacts with the environment by taking an action and receiving a reward for the action taken as seen in figure 1. To make an
منابع مشابه
Argumentation-based reinforcement learning for robocup soccer takeaway
Reinforcement Learning (RL) is widely regarded as a generic and effective technique to learn coordinated behaviours in cooperative multi-agent systems (CMAS), but it suffers from slow convergence speed due to the huge joint action space. Incorporating domain knowledge has shown to be an effective method to tackle this problem, but little research has investigated how to propose high-quality heu...
متن کاملArgumentation Accelerated Reinforcement Learning for RoboCup Keepaway-Takeaway
Multi-Agent Learning (MAL) is a complex problem, especially in real-time systems where both cooperative and competitive learning are involved. We study this problem in the RoboCup Soccer KeepawayTakeaway game and propose Argumentation Accelerated Reinforcement Learning (AARL) for this game. AARL incorporates heuristics, represented by arguments in Value-Based Argumentation, into Reinforcement L...
متن کاملArgumentation accelerated reinforcement learning
Reinforcement Learning (RL) is a popular statistical Artificial Intelligence (AI) technique for building autonomous agents, but it suffers from the curse of dimensionality: the computational requirement for obtaining the optimal policies grows exponentially with the size of the state space. Integrating heuristics into RL has proven to be an effective approach to combat this curse, but deriving ...
متن کاملReinforcement Learning on Multiple Correlated Signals
This extended abstract provides a brief overview of my PhD research on multi-objectivization and ensemble techniques in reinforcement learning.
متن کاملArgumentation Accelerated Reinforcement Learning for Cooperative Multi-Agent Systems
Multi-Agent Learning is a complex problem, especially in real-time systems. We address this problem by introducing Argumentation Accelerated Reinforcement Learning (AARL), which provides a methodology for defining heuristics, represented by arguments, and incorporates these heuristics into Reinforcement Learning (RL) by using reward shaping. We define AARL via argumentation and prove that it ca...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017